Model Selection

Multi-dialect support

# Multi-dialect support

Roest Wav2vec2 1B V2

This is Denmark's most advanced speech recognition model, trained by Alvenir as part of the CoRal project, based on the CoRal-v2 dataset, covering various Danish dialects.

Speech Recognition Other

Roest Wav2vec2 315m V2

Denmark's state-of-the-art speech recognition model trained by Alvenir, based on the CoRal-v2 dataset, supporting multiple Danish dialects

Speech Recognition

Safetensors Other

Nllb1.3 Smugri4 V0.01

This is a version of the NLLB-1.3b model fine-tuned with parallel data for 29 Finno-Ugric languages, supporting the generation of multiple dialects/variants.

Machine Translation

Transformers Supports Multiple Languages

Wav2vec LnNor IPA Ft

A phoneme recognition model fine-tuned based on wav2vec2-base, supporting English speech to International Phonetic Alphabet (IPA) conversion

Speech Recognition English

Uzbek automatic speech recognition model fine-tuned from OpenAI Whisper Medium

Speech Recognition

Transformers Other

Arabic Retrieval V1.0

A high-performance Arabic information retrieval model built on the sentence-transformers framework, optimized for the richness and complexity of the Arabic language.

Text Embedding Arabic

Nb Whisper Large Distil Turbo Beta

A lightweight and accelerated version of the Norwegian automatic speech recognition model developed by the National Library of Norway, reducing parameter count through distillation while maintaining transcription quality.

Speech Recognition

Transformers Supports Multiple Languages

Whisper Large V3 Turbo Cantonese Yue English

A Cantonese and English mixed speech recognition model optimized based on the Whisper architecture, supporting high-precision bilingual transcription

Speech Recognition

Whisper Tiny Myanmar

This model is an automatic speech recognition (ASR) model fine-tuned on Burmese speech datasets based on openai/whisper-tiny, supporting Burmese speech-to-text tasks.

Speech Recognition

Transformers Other

Arabic Alphabet Speech Classification

This is a transformers model for Arabic alphabet speech classification, capable of recognizing and classifying the pronunciation of Arabic letters.

Audio Classification

Nepali male voice synthesis model based on VITS architecture, supporting high-quality text-to-speech functionality

Speech Synthesis

Transformers Other

Speech Accent Pt Br Classifier

A speech-based accent classifier for distinguishing Brazilian Portuguese from other accents.

Audio Classification

Transformers Supports Multiple Languages

Mms Tts Nova Train

This is a Shan language text-to-speech (TTS) model designed to convert Shan text into natural speech.

Speech Synthesis

Transformers Other

Adabtranslate Darija

A translation model for Darija (Moroccan Arabic) to Modern Standard Arabic (MSA), trained on 26,000 manually annotated and GPT-4 enhanced text pairs

Machine Translation

Nb Whisper Base

An automatic speech recognition model developed by the National Library of Norway, based on the OpenAI Whisper architecture, supporting transcription in Norwegian and English.

Speech Recognition

Nb Whisper Large

An automatic Norwegian speech recognition model launched by the National Library of Norway, developed based on OpenAI's Whisper architecture, supporting multiple Norwegian dialects and English.

Speech Recognition

Transformers Supports Multiple Languages

Arabic Morocco Speech To Text

Arabic speech recognition model based on Whisper-large-v3, optimized for Moroccan accent

Speech Recognition

Transformers Arabic

Nb Whisper Large Verbatim

Norwegian automatic speech recognition model developed based on OpenAI Whisper, with additional training for lowercase, punctuation-free verbatim transcription

Speech Recognition Supports Multiple Languages

Nb Whisper Large

An automatic speech recognition model developed by the National Library of Norway, based on the Whisper architecture, supporting speech transcription and translation of Norwegian and English.

Speech Recognition

Malaysian Whisper Base

Whisper base model fine-tuned on Malaysian datasets, supporting Malay and English speech recognition

Speech Recognition

Transformers Supports Multiple Languages

NorBERT 3 xs is a BERT model optimized for Norwegian, the smallest version in the new generation NorBERT language model series with 15M parameters.

Large Language Model

Transformers Other

NorBERT 3 is a next-generation Norwegian language model based on the BERT architecture, supporting both Bokmål and Nynorsk written Norwegian.

Large Language Model

Transformers Other

Whisper Large V2 Hausa

This model is a fine-tuned version of OpenAI's Whisper Large-V2 for Hausa speech recognition tasks, trained on the Common Voice 11.0 dataset

Speech Recognition

Transformers Other

Whisper Small Kab

Georgian automatic speech recognition model fine-tuned based on OpenAI Whisper-small

Speech Recognition

Transformers Other

Whisper Large V2 Malayalam

This is a fine-tuned version of the OpenAI Whisper Large V2 model for Malayalam speech recognition tasks, trained using the Common Voice 11.0 dataset

Speech Recognition

Transformers Other

Wav2vec2 Large Xlsr 53 Spanish Ep5 944h

An acoustic model for Spanish automatic speech recognition, fine-tuned for 5 epochs based on facebook/wav2vec2-large-xlsr-53 using approximately 944 hours of Spanish data.

Speech Recognition

Transformers Spanish

carlosdanielhernandezmena

Wav2vec2 1b Npsc Nst Bokmaal

This model is an automatic speech recognition (ASR) model fine-tuned on the Norwegian Bokmål dialect speech dataset based on facebook/wav2vec2-xls-r-1b

Speech Recognition

Opus Mt Tc Big En Pt

This is a neural machine translation model for English to Portuguese (including Brazilian Portuguese), part of the OPUS-MT project.

Machine Translation

Transformers Supports Multiple Languages

Wav2vec2hindiasr

Hindi automatic speech recognition (ASR) model based on Wav2Vec2 architecture, fine-tuned on public speech datasets

Speech Recognition

Automatic speech recognition model trained on a large-scale Arabic speech dataset

Speech Recognition

Wav2vec2 Large Xlsr Hindi

A Hindi automatic speech recognition model fine-tuned on low-resource Indian language datasets based on facebook/wav2vec2-large-xlsr-53

Speech Recognition

Transformers Other

Wav2vec2 Large Xlsr 53 Breton

A Breton fine-tuned speech recognition model based on facebook/wav2vec2-large-xlsr-53

Speech Recognition Other

Wav2vec2 Large Xls R 300m Urdu

This is an automatic speech recognition model fine-tuned on the Urdu Common Voice 7 dataset based on facebook/wav2vec2-xls-r-300m.

Speech Recognition

Transformers Other

Wav2vec2 Xls R Hindi

This is an automatic speech recognition (ASR) model fine-tuned on the Hindi Common Voice 7.0 dataset based on facebook/wav2vec2-xls-r-300m

Speech Recognition

Transformers Other

Wav2vec2 Large Xlsr Breton

A speech recognition model fine-tuned on the Breton Common Voice dataset based on facebook/wav2vec2-large-xlsr-53

Speech Recognition Other

Wav2vec2 Large Xlsr 53 Punjabi

This is a Punjabi automatic speech recognition model fine-tuned on the Common Voice dataset based on Harveenchadha/vakyansh-wav2vec2-punjabi-pam-10

Speech Recognition

Transformers Other

Wav2vec2 Large Xlsr Coraa Portuguese Cv8

A Portuguese speech recognition model fine-tuned on the Common Voice dataset based on Edresson/wav2vec2-large-xlsr-coraa-portuguese

Speech Recognition

Wav2vec2 Xlsr Basaa

This model is an automatic speech recognition model fine-tuned on the Common Voice 8 Basaa dataset based on facebook/wav2vec2-xls-r-1b.

Speech Recognition

Transformers Other

Wav2vec2 Large Xls R 300m Assamese Cv8

This is an automatic speech recognition (ASR) model fine-tuned on Assamese datasets based on the facebook/wav2vec2-xls-r-300m model

Speech Recognition

Transformers Other

Wav2vec2 Xlsr Tatar

This model is an automatic speech recognition model fine-tuned on Tatar language datasets based on facebook/wav2vec2-xls-r-1b, achieving a word error rate (WER) of 16.87% on the Common Voice 8 dataset.

Speech Recognition

Transformers Other

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase